An extensive comparative study of cluster validity indices
نویسندگان
چکیده
The validation of the results obtained by clustering algorithms is a fundamental part of the clustering process. The most used approaches for cluster validation are based on internal cluster validity indices. Although many indices have been proposed, there is no recent extensive comparative study of their performance. In this paper we show the results of an experimental work that compares 30 cluster validity indices in many different environments with different characteristics. These results can serve as a guideline for selecting the most suitable index for each possible application and provide a deep insight into the performance differences between the currently available indices. & 2012 Elsevier Ltd. All rights reserved.
منابع مشابه
Comparative study on proximity indices for cluster analysis of gene expression time series
In the computational analysis of gene expression time series, the main aspect in finding co-expressed genes is the proximity (similarity or dissimilarity) index used in the clustering method. In this context, the proximity indices should find genes that have similar patterns of expression change through time. There are a number of proximity indices used for such a task. However, the majority of...
متن کاملA Comparative Study of Hard and Fuzzy Data Clustering Algorithms with Cluster Validity Indices
Data clustering is one of the important data mining methods. It is a process of finding classes of a data set with most similarity in the same class and most dissimilarity between different classes. The well known hard clustering algorithm (K -means) and Fuzzy clustering algorithm (FCM) are mostly based on Euclidean distance measure. In this paper, a comparative study of these algorithms with d...
متن کاملEXCLUVIS: A MATLAB GUI Software for Comparative Study of Clustering and Visualization of Gene Expression Data
The result of one clustering algorithm varies from that of another for the same input dataset as the input parameters of an algorithms can substantially affect the behavior and execution of the algorithms. Cluster validity measures can be used to find the partitioning that best fits the underlying data. In most realistic applications, this analysis can be visualized using simple Computer-Aided-...
متن کاملThe study of the Reliability and Validity of Personal Responsibility Scale for Adolescents
The purpose of the present study was to investigate the validity and reliability of the Personal Responsibility Scale for Adolescents (Mergler & Shield, 2016). The statistical population consisted of all first and second year high school students in Firoozabad city who were studying in the year 1396-97. Participants were 300 high school students (111 girls and 189 boys) who were selected by clu...
متن کاملImproving Cluster Method Quality by Validity Indices
Clustering attempts to discover significant groups present in a data set. It is an unsupervised process. It is difficult to define when a clustering result is acceptable. Thus, several clustering validity indices are developed to evaluate the quality of clustering algorithms results. In this paper, we propose to improve the quality of a clustering algorithm called ”CLUSTER” by using a validity ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pattern Recognition
دوره 46 شماره
صفحات -
تاریخ انتشار 2013